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Abstract 

Two intimately related new classes of games are introduced and studied: entropy games 
(EGs) and matrix multiplication games (MMGs). An EG is played on a finite arena by two- 
and-a-half players: Despot, Tribune and the non-deterministic People. Despot wants to make 
the set of possible People’s behaviors as small as possible, while Tribune wants to make it as 
large as possible. An MMG is played by two players that alternately write matrices from some 
predefined finite sets. One wants to maximize the growth rate of the product, and the other 
to minimize it. We show that in general MMGs are undecidable in quite a strong sense. On 
the positive side, EGs correspond to a subclass of MMGs, and we prove that such MMGs and 
EGs are determined, and that the optimal strategies are simple. The complexity of solving 
such games is in NP n coNP. 


1 Introduction 

In recent years, some of us have been working on a new non-probabilistic quantitative approach to 
classical models in computer science based on the notion of language entropy (growth rate). This 
approach has produced new insights about timed automata and languages |l| as well as temporal 
logics Q- I n this article, we apply it to game theory and obtain a new natural class of games 
that we call entropy games (EGs). Such a game is played on a finite arena in a turn-based way, 
in infinite time, by two-and-a- halfl players: Despot , Tribune and the non-deterministic People. 
Whenever Despot and Tribune decide on their strategies cr and r, it leaves a set L(a,r) (an u- 
language) of possible behaviors of People. Despot wants L(a , r) to be as small as possible, while 
Tribune wants to make this language as large as possible. Formally the payoff of the game is the 
entropy of L(a 1 r), with Despot minimizing and Tribune maximizing this value. 

Potentially these games can be used to model hidden channel capacity problems in computer 
security, where the aim of the security policy (Despot) is to minimize the information flow whatever 
the environment (Tribune) does. EGs can also be rephrased in terms of population dynamics, 
where one player aims to maximize the population growth rate, while the other minimizes it; 
applications of this setting to medicine, ecology, and computer security (virus propagation) are 
still to be explored. On the theoretical side, well-known mean-payoff games on finite graphs can 
be seen as a subclass of our EGs. However the purpose of this paper is to explore the theoretical 

‘The support of Agence Nationale de la Recherche under the project EQINOCS (ANR-11-BS02-004) is gratefully 
acknowledged. The results of Section [4] were obtained at the Institute for Information Transmission Problems, 
Russian Academy of Science, by V. Kozyakin at the expense of the Russian Science Foundation (project 14-50- 
00150). 

'Although this term is mostly used for stochastic games, it is also an appropriate description of EGs. 
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setting of EGs, we therefore leave applications and identification of relevant subclasses of EGs for 
further work. 

The second class of objects studied is that of matrix multiplication games (MMGs), which 
came naturally when analyzing EGs and is, in our opinion, novel and interesting on its own. In 
such a game, two players, Adam and Eve, each possess a set of matrices, srf and S, respectively. 
The game is played in a turn-based way, in infinite time. At every turn, the player writes a matrix 
from his or her set. Adam wants the norm of the product of matrices A 1 E 1 A 2 E 2 ■ ■ . obtained to 
be as small as possible (in the limit), while Eve wants it to be as large as possible. Formally, the 
payoff is the growth rate of the norm of the product. 

The main interest of MMGs comes from the observation that, in the case when one of the 
two players is trivial (i.e. his or her set contains only the identity matrix), the game turns into 
the classical, important, and difficult, problem of computing the joint spectral radius or the joint 
spectral subradius of a set of matrices, see 14, l2lj| . Thus, MMGs is a game (or alternating) 


generalization of this problem. It is thus unsurprising that, in the general case, MMGs are even 
more difficult to analyze. We prove that several natural problems for MMGs are undecidable, 
in particular it is impossible to distinguish between games with value 0 and 1 (and thus it is 
impossible to approximate the value of an MMG). 

Fortunately, MMGs have tractable subclasses. We reduce EGs to a particular subclass of 
MMGs (referred to as IMMGs), when the sets and S are so-called independent row uncertainty 
sets of non-negative matrices Q, and show that for this class the game can be solved: it is 
determined, and for each player the optimal strategy is to write one and the same matrix at every 
turn. This result is based on a new, quite technical, minimax theorem on the spectral radius of 
products of the type AB where both A and B belong to sets of matrices with independent row 
uncertainties. We deduce that EGs are determined, and that the optimal strategies for Despot and 
Tribune are positional. A careful complexity analysis of the games considered (EGs and IMMGs) 
allows to prove that comparing their value to a rational constant can be done with complexity 
NP fl coNP. 

The article is structured as follows. In Section [2] we recall useful notions from linear algebra 
and language theory. In Section [3] we formally define the two games and show how they are 
related, we also prove undecidability of general MMGs. In Section 2] we prove the key technical 
minimax theorem for matrices. In Section [5] we prove the main properties of EGs and IMMGs: 
determinacy, existence of simple strategies and complexity bounds. In Section[6]we relate the EGs 
studied here to classical mean-payoff games and a new kind of population games. We conclude 
with a discussion on the perspectives. The Appendix contains proofs of all lemmas. 


2 Preliminaries 

2.1 Some Linear Algebra 

Given two vectors x, y £ BA, we write x ^ y, if Xi ^ yt for each 1 ^ ^ N. Similar notation 

will be applied to matrices. We denote by || • || the 1-norm of vectors and matrices. Note that, for 
non-negative vectors and matrices, ||a;|| = JA x t . 

Let A be an (N x ./V)-matrix. Its spectral radius is defined as the maximal modulus of its 
eigenvalues and denoted by p{A). It characterizes the growth rate of A" for n —>■ 00 : according 
to Gelfand’s formula p(A) = lim^oo ||A ri || 1 / n . The spectral radius depends continuously on the 
matrix, and is monotone for non-negative matrices [131, Corollary 8.1.19]: p(A) ^ p{B) when 
0 ^ A ^ .B. If A > 0, i.e. all the elements of A are positive, then by the Perron-Frobenius 
theorem, the number p(A) is a simple eigenvalue of the matrix A, and all the other eigenvalues of 
A are strictly less than p(A) in modulus. The eigenvector v = {v±,V 2 , ■ ■ ■ ,vn) t corresponding to 
the eigenvalue p{A) (normalized, for example, by the equation Vi = 1) is uniquely determined 
and positive. 

Following Q, given N sets of M-dimensional rows s^i we define the IRU-set (independent row 
uncertainty set) of (TV x M)-matrices that consists of all matrices of the form A = (a^) 
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wherein each of the rows a* = [an, ,..., oim] belongs to the respective srf{,. We will need several 

simple properties of IRU-sets. 

Lemma 1. For an IRU-set srf formed by sets of rows g/i, szi 2 ,..., .c/y the following holds: 

(i) for any matrix B the set &/B = {AB | A £ £/} is IRU as well; 

(ii) the convex hull conv(,c/) is the IRU-set formed by the row sets conv(,e/i),..., conv(^jv); 

(iii) the set £/ is compact if and only if so are all the row sets &/i, 3 / 2 , ■ ■ ■, Jz/n- 

2.2 Joint Spectral Radius and Subradius 

The joint spectral radius 1,0,0 of a bounded set srf of ( N x 7V)-matrices characterizes the 
maximal growth rate of products of n matrices from the set and admits the following equivalent 
definitions (where the identity between the upper and the lower formulas constitutes the famous 
Berger-Wang Theorem Q): 


p(sz/) = lim sup j||di • • • AnW 1 / 71 di e ^/| = inf sup |||di • • • dnll 1 /" d, € f/ j 

= lim sup { p(A\ ■ ■ ■ A n ) l / n A; € jz/\ = sup sup { p(Ax ■ ■ ■ A n ) 1 ^ n A % £ jz/\ . (1) 

n—>• 00 L J ^>1 l J 


For a compact (closed and bounded) set jzf, the suprema in (JT]) may be replaced by maxima. 

The joint spectral subradius 10] , or lower spectral radius , corresponds to the minimal growth 
rate of products of matrices: 


p(sz/) = lim inf j||di • • • A n \\ 1 ^ n Ai £ jz/J = inf inf |||di • • • A n \\ 1 l n Ai £ sz /j 

= lim inf |/o(di • ■ • A„) 1/,n Ai £ jz/J = inf inf |p(di • • • A n ) 1 / ra Ai £ jz/J . 


The equivalence of the characterizations based on norms and on spectral radii is established in [0 
Theorem Bl] for finite sets jz/, and in [3, Lemma 1.12] and 3, Theorem 1] for arbitrary sets jz/. 
Calculating the joint and lower spectral radii is a challenging problem, and only in exceptional 
cases these characteristics may be found explicitly, see, e.g., [lj, |l5[ and the bibliography therein. 
The case of compact IRU-sets of non-negative matrices is such an exception, for which p and p 
admit a simple characterization: as stated in [3 Theorem 2], for such a set srf the following 
equalities hold: 

p{s!l) = maxp(A), p(sz/) = min p(A). (2) 

Ads? 

Compact IRU-sets of non-negative matrices and their convex hulls have another useful property: 
as is shown in [0 Corollary 1], 


max p{A) = 
Aesrf 


max p(A), 

AG conv(^/) 


min p(A) 
Ae#/ 


min p(A), 
AG conv(^) 


(3) 


and hence p(£/) = p(conv(jz /)), p(s/) = p( conv(j^)). 


2.3 Entropy of an w-Language 

The notion of entropy of a language and methods for computing it in the case of regular languages 
were introduced in 0] for finite words and in j0 ] for infinite ones. We will use the latter definition. 
The entropy of an w-language L C YF is defined as 


H (L) = lim sup 

n—>oo 


l°g|pref„(L)| 

n 
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(all the logarithms here are in base 2), where pref n (L) is the set of prefixes of length n of infinite 
words in L. Intuitively, H(L) is the information content (“bandwidth”), measured in bits per 
symbol, in typical words of the language. In particular, Id (E“) = log |E|. 

For a regular L C £“ accepted by a given Blichi automaton, its entropy can be effectively 
computed as follows: compute the (finite) automaton recognizing pref(L), determinize it, and 
compute the entropy as the logarithm of the spectral radius of the adjacency matrix of the au¬ 
tomaton obtained. 


3 The Two Games 

3.1 Entropy Games 

Consider the arena ( D , T, E, A) where D and T are disjoint finite sets of vertices (of two players), 
E a finite alphabet of actions and ACTxExDUDxExTisa transition relation. Given 
such an arena, we define a game with two-and-a-half players: Despot, Tribune and People. The 
latter plays non-deterministically and counts for half a player. People chooses the initial state in 
D. When the game is in a state d of D, Despot plays an action a £ E and the game changes to 
some t £ T (chosen by People) such that (d, a , t) £ A. Then, Tribune plays an action b £ E and 
the game changes its state to d' £ D, again chosen by People and such that (t,b,d') £ A. It is 
again Despot’s turn. The players must not block the game: they always choose an action that has 
a corresponding transition (d, a, •) £ A, or (f, 6, •) £ A, respectively. We assume that the arena is 
non-blocking: at every state there is at least one such transition. Figure |T] shows an example of 
such an arena, which we will use as a running example in this paper. 




Figure 1: Left. Arena of our running example of an entropy game. Circles are states of the Despot 
while squares are states of the Tribune. At each move, the player has to choose between actions 
a and b , the outcome of which may sometimes be non-deterministic (e.g. when Despot plays a in 
state c? 2 , the next state may non-deterministically be either t\ or 1 3 ). Right. A finite play on this 
arena. Despot plays ab (whatever his opponent does) while Tribune plays aa. We only give, for 
each step, the number of words that end up in each state controlled by the active player. 

A play of the EG is a finite or infinite sequence 7 r £ (D ■ E • T ■ E)°° compatible with the 
transition relation A. Note that four letters in a row correspond to one turn of the game. A 
strategy <7 for Despot is a function ( D • E • T • E)* • D — xE that, given any finite play ending in 
a D state, outputs an action taken by Despot. The strategy is positional if it only depends on 
the current state of the game, i.e. it can be expressed just as a(d). A strategy r for Tribune is a 
function (D ■ E • T ■ £)* • D ■ E • T —> E which, given any finite play ending in a T state, outputs 
the action taken by Tribune. The strategy is positional if it only depends on the current state 
of the game. In a natural way we define plays compatible with a Despot’s strategy <7, or with 
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a Tribune’s strategy r. Then, given er and r, we have an w-language L(cr, r) containing all the 
plays compatible with er and r. In other words, L(cr, r) is the set of runs that People can choose if 
Despot and Tribune commit themselves to er and r. What makes EGs different from other games 
(parity/mean-payoff etc.) is that the payoff does not depend on a single run of the game, but 
on the whole set of possible runs. More precisely, the payoff (the amount that Despot pays to 
Tribune) is defined as 

P(ct,t) = limsup|pref 4n (L(cr,r))| 1/n , 

n—too 

that is the growth rate (w.r.t. the number of turns) of the number of plays available to the People 
under the strategies a and r. Note that the payoff is a monotone function of the entropy of L{a, r), 
indeed 

P(ct,t) = 

i.e. Despot tries to diminish the entropy while Tribune aims to augment it. 

3.2 Matrix Multiplication Games 

Let £/ be a set of M x 7V-matrices and S of N x M-matrices. The MMG between two players, Adam 
and Eve, is played as follows: in turn, for every i £ N, Adam writes a matrix Ai £ stf and then Eve 
writes a matrix Ei £ S. Formally, we define a play as an infinite sequence A 1 E 1 A 2 E 2 ■ ■ ■ AiEi... 
with Ai £ and Ei £ S. A strategy for Adam is a function cr : (#/ ■ S)* —» sS that maps any 
finite history (which is a sequence of matrices) to Adam’s next move. Similarly, a strategy for Eve 
is a mapping r : ■ S)* ■ sS —»• S. A strategy is called constant if it does not depend on the 

history, i.e. is given by just one matrix: a = A £ srf or t = E £ S. We define a play compatible 
with a strategy cr (or r) in a natural way. Note that, given a strategy a for Adam and a strategy 
r for Eve, there exists a unique play 7 r(cr, r) compatible with both of them. The payoff of a play 
tv = A 1 E 1 A 2 E 2 ■ ■ ■ AiEi... (that is, the amount that Adam pays to Eve) is the growth rate of the 
norm of the infinite product of matrices: 


P(jv ) = P(<J, t) = limsup 

k —^00 

3.3 General Matrix Multiplication Games are Undecidable 

The difficulty of general MMGs should be compared with results on the difficulty of JSR (joint 
spectral radius) computation. Thus, as proved in [ 5 J, Thm 2], given a finite set S of non-negative 
matrices with rational elements, it is undecidable whether p{S) ^ 1. The decidability status of 
the problem p(S) < 1 is unknown. Finally, it is immediate from the characterization ([1]) that, 
given a precision e > 0, it is possible to compute £-approximation of p(S) (in other words p(S) is 
computable as function of S in the sense of computable analysis, see [23jI. 

Theorem 2. Given a determined MMG with finite sets of non-negative matrices with rational 
elements and a £ Q+, the decision problem for its value V ^ a is undecidable. 

Proof. Let £/ = {Id} (Adam is trivial) and if be a finite set of non-negative matrices with 
rational elements. The corresponding MMG is determined with value V = p{S) and thus the 
decision problem V < 1 is undecidable due to [1, Thm 2], cited above. □ 

To prove stronger undecidability results for MMGs without direct counterparts for the JSR, 
we need a couple of simulation lemmas: for arbitrary matrices and for non-negative ones. 

Lemma 3. Given a two-counter machine M, one can construct two finite sets of integer matrices 
and S such that the corresponding MMG is determined and its value V satisfies: 

“if AI halts (starting with counters containing 0) then V = 0, else V = 1.” 


II" 4, 

i=l 


Ei 
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Lemma 4. Given a two-counter machine M, one can construct two finite sets of non-negative 
integer matrices si and $ such that the corresponding MMG satisfies: 

“if M halts then Adam can ensure payoff < 2, otherwise Eve can ensure payoff ^ 2.” 

In both cases the construction, inspired by 0 , follows the same principle: Eve tries to simulate 
the machine M; if she cheats, then Adam detects this and “resets” the product. Since the halting 
problem is undecidable, we obtain immediately the following two theorems. 

Theorem 5. Given a determined MMG with finite sets of matrices with integer elements 

• its value V is not computable from the matrices; 

• it is not computable even knowing a priori that V £ {0,1}. 

Hence the MMG value cannot be approximated and is not computable (as function of si and 
S) in the sense of computable analysis. 

Theorem 6. Given an MMG with finite sets of non-negative matrices with integer elements, it 
is undecidable whether the maximal payoff that Eve can ensure is < 2. 

3.4 Relations Between the Two Kinds of Games 

Fortunately, as will be shown below, the subclass of MMGs with IRU-sets of non-negative matrices 
is much easier to solve. In this section, we relate EGs to such MMGs. 

Let A = (D,T, E, A) be an arena with D = {di,..., c?m} and T = {ti,..., ijv}- We define 
matrix sets si, S' as follows. For each Despot’s vertex di £ D , and action a £ E we define the row 
Cia = [c»o,i> • ■ ■ j Cia,jv] where Ci a j = 1 if (di,a,tj) £ A and Ci a j = 0 otherwise. Next we define 
the row set si = { Ci a ^ 0 | a £ E} (non-zero rows correspond to non-blocking actions). Row 
sets si \,..., s4m determine an IRU-set of matrices si. The IRU-set S corresponding to Tribune’s 
actions is defined similarly. In the running example in Figure [TJ for instance, the row sets are the 
following: six = {[1,1,0]} , = {[0,1, 0], [1, 0,1]} , si 3 = {[0,1,1]} , = {[0,1, 0], [1, 0,0]} , S 2 = 

{[ 1 , 1 , 1 ]}, = {[ 0 , 1 , 0 ], [ 0 , 0 , 1 ]}. 

Note first that there is a natural bijection between the positional strategies of Despot and the 
set si : any positional strategy cr : D —> E corresponds to the matrix A a £ si with i-th row Cj 
for Adam. Similarly, a positional strategy of Tribune r corresponds to Eve’s matrix E T £ S. The 
following lemma generalizes this observation to any type of strategies: 

Lemma 7. Let A be an arena and si,S the corresponding IRU matrix sets. Then for every pair 
of strategies (cr, t) of Despot and Tribune in the EG on A there exists a pair of strategies (?, 0) of 
Adam and Eve in the MMG (conv(^), conv(d’)) with exactly the same payoff. Moreover, if a is 
positional, then <; is constant and permanently chooses A a . The case of positional r is similar. 

Note that Lemma [7] provides a rather weak relation between two games and does not mean, by 
itself, that the two games have the same value. However, we will show later (cf. Lemma fl5l) that 
optimal constant strategies in the MMG that belong to si and $ are in bijection with optimal 
positional strategies in the EG. 

4 Minimax Theorem for IRU-Sets of Matrices 

In this section, we prove the key theorem of this article. 

Theorem 8. Let si be a compact IRU-set of non-negative (N x M)-matrices and LH be a compact 
IRU-set of non-negative (M x N)-matrices. Then 

min maxp(dB) = max min p(AB). (4) 
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In the rest of the article we will denote this minimax by J 1 ). The study of minimax 

relations will be based on the following well-known fact: 


Lemma 9 (see [22|, Section 13.4]). Let f{x,y) be a continuous function on the product of compact 
spaces IxY. Then 


min max f(x, y) ^ max min f{x,y). 

x y y x 


The exact equality holds if and only if there exists a saddle point, i.e. a point (xo,yo) satisfying 
the inequalities 

f{xo,y) < f{xo,yo) < f(x,yo) 


for all x £ X, y £Y. 


We will also use two lemmas on matrices. The first one provides spectral radius bounds and is 
quite standard in Perron-Frobenius theory; as usual in this theory it relates global characteristics 
of a non-negative matrix (such as spectral radius) with its behavior on one non-negative vector. 

Lemma 10. Let A be a non-negative (N x N)-matrix; then the following properties hold: 

(i) if Au ^ pu for some vector u > 0, then p ^ 0 and p{A) ^ p; 

(ii) if furthermore A > 0 and Au ^ pu, then p{A) < p; 

(iii) if Au ^ pu for some non-zero vector u ^ 0 and some number p ^ 0, then p{A) ^ p; 

(iv) if furthermore Au ^ pu, then p{A) > p. 


The next lemma concerning IRU-sets of matrices is new and can be explained as follows. For 
an IRU-set of matrices and two vectors u and v we imagine that the sets Bi = {x : x ^ v} and 
B u = {x : v Sj a;} form the lower and upper bulbs of an hourglass with the neck at the point v. 
The lemma asserts that either all the grains Au (for all matrices A in the set) fill one of the bulbs, 
or there remains at least one grain in the other bulb. Clearly this alternative does not hold for 
general sets of matrices. 

Lemma 11 (hourglass alternative). Let srf be an IRU-set of (N x M)-matrices and let Au = v 
for some matrix A £ srf and vectors u,v. Then the following holds: 

(i) either Au Js v for all A £ sY or exists a matrix A £ sY such that Au ^ v and Au ^ v; 

(ii) either Au St v for all A £ sY or exists a matrix A £ sY such that Au ^ v and Au ^ v. 


We are ready to prove the minimax theorem. 

Proof of Theorem [21 According to Lemma [9l the minimax equality (j4j) may occur if and only if 
some matrices A £ sY and B £ S3 satisfy the inequalities 


p(AB) ^ p(AB) for all B £ (5) 

p{AB) < p{AB ) for all A £ sf. (6) 


Consider first the case when all the matrices in sY and S3 are positive. To construct the 
matrices A £ zY and B £ S3 we proceed as follows. For each B £ S3 let Ab £ be a matrix that 
minimizes (in A) the quantity p(AB). Such a matrix Ab exists due to compactness of the set 
and continuity of the function p{AB) in A and B. Then, for each matrix B £ S3, the relations 

plAsB) = min p(AB) ^ p(AB) 


hold for all A £ xY. Let B be the matrix maximizing minp(AB) over the set S3, and let 
A = Ab. In this case 


ma xp(AsB) = max min p(AB) = min p(AB) = p(AbB) = p(AB), (7) 

v ’ Ae,sW v ’ B > j, \ j 
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which implies inequality © for all A € #/, and it remains to prove (O for all B £ S3. 

Let v = (v±,V 2 , ■ ■ ■ ,v N ) T be the positive eigenvector of the (N x jV)-matrix AB corresponding 
to the eigenvalue p = p(AB). By denoting 

w = Bv G K m 

we obtain that pv = Aw. Let us show that in this case 

pv ^ Aw for all A e srf■ (8) 

Otherwise, by Lemma m) there would exist a matrix Aerf such that pv ^ Aw and pv yA Aw , 
which implies, by the definition of the vector w, that pv ^ ABv and pv yA ABv. Then by 
Lemma [TU] 

p{AB) < p = p{AB ), 

which contradicts ©. This contradiction completes the proof of inequality 0. Similarly, now we 
show that 

w ^ Bv for all B € S3. (9) 

Again, assuming the contrary, by Lemma |1 If ii) there exists a matrix B £ S3 such that w sj Bv 
and w y Bv. This last inequality, together with © applied to the matrix Ag, yields pv < AgBv 
and pv yA AgBv. Then by Lemma [TUI 

P < p{AgB), 

which contradicts 0 asserting that p = p(AB) is the maximum value of the function p(AgB) 
over all B £ S3. This contradiction completes the proof of inequality 0. 

From pv = Aw and 0 we obtain the inequality pv ^ ABv valid for all B £ S3, which by 
Lemma [TUI implies the relations 

p{AB) = p > p(AB) 

valid for all B £ S3, or, which is the same, inequality The theorem is proved for positive 
matrices. 

Consider now the general case of compact IRU-sets of non-negative matrices srf and S3. If 
the set si is determined by some sets of M -rows sti, i = 1,2,..., AA, then choose an arbitrary 
e > 0 and consider the sets of rows 

= {a ( ' e) | = a + e[l, 1,..., 1], a G M}, 

where i = 1,2,..., N. In this case the IRU-set of matrices ,sS l ' F ' 1 consists of positive matrices 
A + e 1, where A £ st and 1 is the matrix with all elements equal to 1. Define similarly the 
IRU-set of matrices 33^. 

By the result just proved, for each e > 0 the minimax equality holds for positive matrices: 

min max p(AB) = max min p(AB), 

A^si^ BeSS^ 

which by Lemma [9] is equivalent to the existence of A e £ and B e £ S3 such that 

p{{A e + sl)(B + el)) sC p((A £ + el )(B e + el)) ^ p{{A + el )(B e + el)) 

for all A £ s# and B £ S3. Taking here e = e n , where {e n } is an arbitrary sequence of positive 
numbers converging to zero, we get 

p((A en + e nl)(B + e n l)) ^ p((A Sn + s n l)(B Sn + e n l)) < p((A + s n l)(B en + e n l)) (10) 

for all A £ s/ and B £ S3. Without loss of generality, in view of the compactness of the sets s# and 
S3 , we may assume the existence of matrices A and B such that A £n -A A £ srf and B £rl -A B £ S3 
as n —> oo. Then turning to the limit in (1101) . we obtain the inequalities 

p(AB) S p(AB) < p(AB) 

for all A £ $/ and B £ S3, which are equivalent to © and ©• This concludes the proof. □ 
Corollary 12. For IRU-sets stf and S3 of non-negative matrices it holds that 

mm(conv(^/), con v(33)) =mm (s/,33). 


5 Solving the Games 

5.1 Solving Matrix Multiplication Games for IRU-Sets 

Theorem 13. Let srf and S’ be compact IRU-sets of non-negative matrices. Then the correspond¬ 
ing MMG is determined, and moreover Adam and Eve possess constant optimal strategies. 


Proof. Let us apply Theorem [8] to matrix sets and S'. Define V, Eq and Aq such that 

min p(EAo) = max min p(EA) = min max p(EA) = max p(EnA) = V. 

Ee£ Aerf E^e E^S At^si Aejrf 


( 11 ) 


Let Adam only play Aq. Take any compatible play n = AqE-\ AqE^ ■ ■ ■ and put C% = A$Ei. Denote 
‘if = {EAq\E € S'}; it is an IRU-set by Lemma [TJ The payoff P for n yields 

P = limsup ||A„Ci • • • Cn-xEnW 1 ^ < limsup(||Ao|| • ||Ci • • • C n - 1 || • ||£„||) 1/n 

n—¥ oo n—y oo 

< lim K~ limsup IICi ■ ■ ■ C n ~ill < p(f&) = max p(C) = max p(EA 0 ) = V ., 

ti y oo yi y oo G E £ 

where the constant K is an upper bound for the norms of the matrices in stf and S, equality 1 
comes from the first equality © and equality 2 comes from (fill) . 

Let Eve only play Eq. Take any compatible play n' = A1E0A2E0 ■ ■ ■. Let us write Di = AiE$. 
Denote TA = {AEq, A € it is an IRU-set. The payoff P' for 7 r' is such that 

P' = limsup 11 Ci • • • C n \\ 1/n > lim inf ||Ci • • • C rl || 1 / n > p{IA) = min p{D) = min p(AE 0 ) = V, 

1 %—^00 fOG) D A £ 

where equality 1 comes from the second equality @ and equality 2 from (fill) using the equality 
p{EA 0 ) = p(AqE). 

We have proved that Adam (by constantly playing Ao) can ensure payoff V whatever Eve 
plays; and that Eve (by constantly playing Eq) can ensure payoff ^ V whatever Adam plays. This 
concludes the proof. □ 

Corollary 14. Let sA and S be compact IRU-sets of non-negative matrices. In the MMG on 
conv(.sz/), conv(fo’), the constant optimal strategies can be chosen from sets sA and S. 

This follows immediately from the proof of the theorem and Corollary [U21 


5.2 Solving Entropy Games 

In this section, we consider an EG on an arena A and the corresponding matrix sets sA and $, as 
defined in Section ITTTTl 


Lemma 15. Let [a, r) be two positional strategies in the EG. Then, if corresponding constant 
strategies A„ and E T are optimal for their respective players in the MMG with matrix sets conv(^) 
and conv(d’), then so are cr and r. 

Theorem 16. Every EG is determined, and Despot and Tribune possess positional optimal strate¬ 
gies. 

Proof. From Theorem fl3l we know that for the MMG (conv(^), conv(d’)) both Adam and Eve 
possess optimal strategies, which consist in constantly playing some matrices A and E. From 
Corollary [Ill the matrices A and E can be chosen from sets srf and S, respectively. Then, there 
exist positional strategies a and r on A such that A = A a and E = E T . By Lemma [151 strategies 
a and r are optimal in the EG. □ 
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Back to the running example. Here a quick exploration of the combinations of rows shows 
that the matrices realizing the minimax over the two IRU-sets defined by row sets «g^i, ^ and 

for Adam/Despot and E = 


, $ 2 , are A — 


110 
1 o 1 

Oil 


10 0 
ill 
0 0 1 


for Eve/Tribune. These matrices 


describe both the optimal constant strategy of the MMG and the optimal positional strategy of the 

/ 2 1 1 ” 

EG induced by this arena. The value of both games is the spectral radius p(AE) = p ( 


(y/17 + 3) /2 ~ 3.562. 


ami) 


5.3 Complexity Issues 

We will analyze the complexity of solving matrix multiplication (and hence entropy) game. We 
start with necessary and sufficient conditions for inequalities on joint spectral radii and subradii 
of IRU-sets (recall also © relating them to maximal and minimal spectral radii). 

Lemma 17. For any compact IRU-set of positive matrices srf and a G Q+ the following equiva¬ 
lences hold: 


p(srf) <o«3®> OVA G srf(Av < av)-, (12) 

p(srf) < a <t=> 3v > OVA G £/(Av ^ av)\ (13) 

p(s?) > a <t=> 3v > OVA G st(Av > av ); (14) 

p(£/) ^ a <^3v > OVA G sf(Av ^ av). (15) 

If the matrices are only non-negative, the equivalences HU above and (HU) below hold: 

p(srf) ^ a <t=> 3(n > 0, v ^ 0) VA G srf(Av ^ av). (16) 


The computational aspects of calculating the values p(£Sf) 
negative matrices, based on relations m, are discussed in 


for IRU-sets of non- 
These articles provide 

polynomial algorithms for approximation of the minimal and maximal spectral radii, as well as 
a variant of the simplex method for these problems. In the next theorem we prove a complexity 
result in a form suitable for game analysis. 


Theorem 18. Given a finite IRU-set of nonnegative matrices srf with rational elements (rep¬ 
resented by row sets srf\, sAi, ..., fa), and a number a G Q+, the decision problems whether 
p(srf) < a and whether p(srf) ^ a belong to the complexity class P. Moreover, if the matrices are 
positive, then the decision problems p(srf) ^ a and p(srf) > a are also in P. 


Proof. The polynomial algorithms are based on the previous lemma. Consider the problem of 
deciding p(s^) < a, which can be rewritten using (fl2l) as > OVA G stf(Av < av). We will 
not test all the matrices A G V (there are exponentially many of them); instead, we will treat 
each row separately. The condition VA G stf(Av < av) can be rewritten as a system of linear 
inequalities: for each i and for each row [c±, C 2 , ..., cn] G M require that 


c\v\ + c 2 v 2 + • • • + cnvn < avi. 


The condition v > 0 can be written as N inequalities 'ty > 0: one for each coordinate. Using a 
polynomial algorithm for linear programming we can decide whether a solution v satisfying all 
these linear inequalities exists. 

All other decision procedures, based on HU HU. are similar. The condition v ^ 0, v ^ 0 can 
be represented as a disjunction of N linear systems Vj > 0 A /\f =1 ih >0. □ 

Theorem 19. Given two finite IRU-sets of nonnegative matrices srf and AS with rational elements, 
and a number a G Q+, the decision problem of whether mm(s/,A§) < a belongs to NP fl coNP. 
Moreover, if the matrices are positive, then the problem of whether mm(jV, AS) ^ a is also in 

NP fl coNP. 
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Proof. Consider the problem of deciding whether mm(,e/, 88) < a, which can be rewritten as 

min ma xp(BA) < a, 

or equivalently 

3A 0 £ s8(p(8SA 0 ) < a). 

The nondeterministic polynomial algorithm proceeds as follows: 

• guess non-deterministically a matrix Aq £ 

• compute the representation of S3A 0 as an IRU-set generated by the row sets ^l, ^ 2 , • • •, c £>.' n ; 

• check the inequality /3(^Ao) < a in polynomial time using Theorem 1181 

We conclude that the problem 8 $) < a is in NP. The complementary problem mm (j8 , 88) ^ a 

is also in NP, as it can be rewritten as 

max min p(AB) ^ a, 


or equivalently 

3Bq £ 83{p{s8 Bo) ^ a), 

and decided by a non-deterministic polynomial algorithm similarly. We conclude that the two 
problems belong to NP fl coNP. 

For positive matrices, the proof for the other decision problem based on the second statement 
of Theorem IT8l is similar. □ 

Our main complexity result follows immediately. 

Theorem 20. Given an EG or an MMG with finite IRU-sets of non-negative matrices with 
rational elements and a £ Q+, the decision problem for its value: V < a is in NP 0 coNP. 

6 Related Models 

6.1 Weighted Entropy Games 

Up to now we have considered entropy games with simple transitions, but it is straightforward 
to add multiplicities (weights) to them. A weighted entropy game is played on a weighted arena 
A = (Z), T, E, A, w) with a function w : A —> N+ assigning weights to transitions (informally 
a weight is the number of ways in which a transition can be taken). Strategies and plays are 
defined as in the unweighted case. Let L be some set of (infinite) plays. For every u £ pref(L) 
we define its weight w{u) as the product of weights of all the transitions taken along u. We define 
w n (L) = X) uepref (l) w ( u )i an d finally the payoff corresponding to strategies a and r of two 
players is defined as: 

P = limsup ( w n (L(a , t))) 1 ^ . 

n—> 00 

Our main results on EGs (Thrns fl6l and l20l) extend straightforwardly to weighted EGs. 

6.2 Mean-Payoff Games 

Well-known mean-payoff finite-state games (MPG) [11] can be considered as a deterministic sub¬ 
class of weighted entropy games. A (variant of) MPG is played on arena (H,T, A,w) with tran¬ 
sition relation ACflxTUTxfl and weight function w : A —x N. The play starts in some state 
do £ D : and the two players choose transitions in turn. The resulting play is an infinite word 
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7 d 0 £ {D ■ T) u . The mean-payoff corresponding to the play "fd 0 = do, to, d\, ti,... is the limit of 
the average weight of transitions taken: 

1 n 

m P(7do) = limsup - '^2(w(d i -i,t i -i) + w(U-i,di)). 

n—»• oo 71 - , 

l—l 

Finally, player D wants to minimize and player T to maximize the payoff max^gc mp( 7 d 0 ). As 
proved in ll|, MPGs are determined and their optimal strategies are positional. As for complexity, 
}24( | shows that testing whether the value of an MPG is smaller than a rational a is in NP fl coNP 
and becomes polynomial for weights presented in the unary system. 

An MPG A = (D,T, A,w) can be transformed into a weighted EG A' = (D, T, E, A', w') 
as follows. The states of both players are the same, E is large enough, and for each transition 
( p, q) £ A there is a corresponding transition ( p, a, q) £ A' with some a (occurring only in this 
transition). Its weight is w'(p,a,q) = 2 w ( p,q \ We notice that the EG obtained is deterministic: 
due to unique transition labels for any strategies a and r, the language L(a, r) contains one play 
for each initial state. Strategies and plays of both games A and A' are now in natural bijection 
and the payoff of A equals the logarithm of the payoff of A'. 

This way, we obtain the classical results that MPGs are determined and both players have 
optimal positional strategies. Due to the exponential encoding of payoffs, the complexity obtained 
using our approach is, however, not as good as using direct algorithms, see [2~l[ . 

6.3 Population Dynamics 

Consider an EG with arena A = {D,T, E, A). It can be interpreted as the following population 
game between two players, Damien and Theo. Elements of D and T correspond to species (forms 
of viruses, microorganisms, etc.). Initially there is one (or any non-zero number of) organism(s) for 
each species in P. At his turn Damien chooses an action a £ E and applies it to each organism. 
An organism of species d, when subject to action a, turns into the set of organisms of species 
{t | ( d , a, t) £ A}. Theo plays similarly. The aim of Damien is to minimize the growth rate of the 
population, while Theo wants to maximize it. The value of the game and the optimal (positional) 
strategies are the same as for the EG. 

7 Conclusions 

We have introduced two (closely interrelated) families of games: entropy games played on finite 
arenas (graphs), and matrix multiplication games. The main result is that entropy games are 
determined and optimal strategies are positional in EG, while MMGs for IRU-sets of non-negative 
matrices are determined and optimal strategies are constant. These results are based on a new 
mininrax theorem on spectral radii of products of IRU-sets of matrices. The results obtained prove 
the existence of equilibria in zero-sum games with a new type of limit payoffs, which is neither 
computed on a single play of the game nor probabilistic. On the other hand, they rely upon and 
generalize important results on the computability of joint spectral radii and subradii, an important 
problem in switching dynamic systems. 

A presumably straightforward extension would be the “probabilization” of our game models, 
in that both Despot and Tribune would be allowed to play randomized strategies. The minimax 
theorem ensures the existence of optimal pure strategies for both players. However the entropy- 
based payoff of the game needs to be given a proper generalization to this probabilistic setting. We 
may mention that such a generalization could be seen as entropy games on stochastic branching 
processes, and provide interesting links with this research domain. Finally, both our games are 
turn-based games with perfect information. The first generalization to be considered is to go 
to concurrent games — where perhaps some polynomial-size memory is needed, similarly to the 
classic case of concurrent games played on graphs in infinite time. The more difficult case is that 
of games of imperfect information: corresponding matrix games no longer have a simple structure 
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(independent row uncertainty), and we conjecture that analysis of such games is non-computable. 
Last but not least, potential applications sketched in the introduction should be addressed. 
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A Proofs 

Proof of Lemma |T| 

Proof. (i) Let ^ be the set of admissible i-th rows in 


^ ^ O'k^kj 


.k=l 


- 


a £ g/i 


and £% be the IRU-set made from sets 3%i . One has that g/B = £%\ 

• if M £ J then let aW g ^ be such that the i-th row of M is 
then M = AB where A is the matrix made with rows ai\ 

• conversely, if A £ g/ and is the i-th row of A, then the i-th row of AB equals 


E Tl (1)7 

fc=1 a k bkj 




E u ( i) 1 

fc= 1 a fe ko 




and belongs to Mi- 


(ii) The easy direction is C. Let M be a matrix of conv(^). Then, there exist matrices 
Mi,..., Mu £ grf and real numbers Ai,..., Afc such that 


M=J2 AiMi. 


Let j be an integer in {1 ,,n}. For all i £ (1 ,,n}, there exists a vector U; £ g/j such 
that row j of Mj is w,. Then, row j of M being f , A^Uj, it belongs to conv g/j. 

For the direction D, let M be a matrix of the IRU-set formed by conv(^i),..., conv(g/ n ). 
Let ui,...,u n be the rows of the matrix M. By definition of M, there are integers ki 
for i £ {l,...,n}, real numbers A*- £ [0,1] and vectors v} £ g?i for i £ {1 and 

j £ {1,..., ki} such that 

ki ki 

- E A >:; and E A l = L 

t=i i=i 

Then, for all i £ {1,..., n}, one has: 


= E44 = in£ 4 ] (E A : 


li =1 


. !=11j=1 


u»=i 



n fc( \ fci 

IT E a iJ = E---E (II A i 


i=i+lll=l 


li=l 


!»=1 \!=1 


Hence 



Ul 

ki 

k n / n \ 


M = 

. U " 

= E- 

ji=i 

•e ik 

In =1 V=1 / 

. . 


each matrix in the sum being in g/. The proof is finished stating that 


ki k n n 

E ■ ” E II X n 


1 1 = 1 ln = l /=1 


n fc; 

n e * 5 , 


i=i ii=i 


= 1 . 


(iii) Immediate from the characterization of compact sets (of finite dimension) as bounded and 
closed. □ 
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Proof of Lemmas [3] and [4] 

In both lemmas, we announce that it is possible to reduce the halting problem of a 2-counter 
Minsky machine (2CMM) to the threshold problem for MMG. 

So let us first remind the reader about 2CMMs. Such a machine can be defined as a set of 
instructions, indexed by a finite set of states Q , with q 0 £ Q marked as initial, operating on two 
non-negative integer counters x and y. There are three types of instructions (g are states in Q , 
and c is either x or y): 

1 . qp. increment c then execute qj ; 

2 . qi: if c = 0 execute qj , else decrement c then execute qk ; 

3. qp stop. 

The computation starts from instruction qo , executing it and thus triggering a sequence of in¬ 
structions, which may be finite, if it eventually reaches a stop instruction, or otherwise infinite. 
Whether or not the execution will be finite is undecidable. 

Obviously, both reductions consist in encoding any 2CMM into an MMG, the payoffs of which 
depend on whether the machine halts or not. 

Now let us describe the encoding used in Lemma [3] Here the 2CMM is translated into two 
sets £/ and $ of square matrices of dimension \Q\ + 5. States of the 2CMM (discrete location 
and counter values) are encoded, along with some other information, as (row) vectors of the 
space on which these matrices operate. The \Q\ first coordinates of such a vector, labelled with 
q 0 , ■ • •, 9 |q|-i, take a non-zero value only for the current state of the simulation. The two next 
coordinates x and y represent the two counters. Finally, there are three additional coordinates: 
One, E and Neg (the role of which will be explained later on). 

Eve’s matrices allow her to simulate the machine execution (as long as it goes on). The set S 
consists in exactly one matrix per transition of the 2CMM (warning: instruction 2 consists in two 
different transitions, depending on the test c = 0, while instruction 3 consists in no transition, 
i.e., a state with stop instruction is a deadlock state). For the sake of presentation, we describe 
them below as sets of assignments of variables, but it is easy to see that all assignments actually 
are linear operations: 

• matrices I qq ' c (as Increment ): q := q — One ; q' := q' + One', c := c+ One; 

• matrices K q p c (as Keep current counter value): q := q — One; q' := q' + One; c := — c; 

• matrices D qq f c (as Decrement ): q := q — One; q' := q' + One; c := c — One. 

Notice that matrix K qq ' c should be normally applied when c = 0, and thus the operation c = —c 
does not harm. On the contrary, if it is applied illegally, for a positive counter value, then it results 
in a negative c. 

Adam has five kinds of matrices, which he can use to detect whenever Eve does not faithfully 
simulate the machine, and then punish her by forcing a payoff of 0. Here is the set srf: 

• the matrix Init (initialize the 2CMM): q 0 := E; q ^o := 0; x := 0; y := 0; One := E; 

Neg := 0 

• the identity matrix Id (do nothing and just let Eve continue playing); 

• the matrices F c ( flash and take a picture of coordinate c) for c corresponding to a state or 
a counter: Neg := c; 

• the matrix A ( adjust the value of Neg): Neg := Neg + One; 

• the matrix P ( punish Eve by assigning 0 to E): E = E + Neg. 

Now, in order to prove Lemma [3l it suffices to prove the two following sublemmas: 
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Lemma A.l. The MMG obtained by the translation above from a non-halting 2CMM is deter¬ 
mined with value 1, i.e. it has the following properties: 

1. there exists a strategy of Adam a o such that for any strategy r of Eve, P(cto,t) ^ 1; 

2. there exists a strategy of Eve r 0 such that for any strategy a of Adam, P(cx, r 0 ) > 1. 

Lemma A.2. The MMG obtained by the translation above from a halting 2CMM is determined 
with value 0, i.e. it has the following properties: 

1. there exists a strategy of Adam a o such that for any strategy r of Eve, P{gq,t) ^ 0; 

2. there exists a strategy of Eve tq such that for any strategy a of Adam, P(a, tq) ^ 0. 

Proof sketch of Lemma \A.l[ 1. Let the strategy <jq consist in always playing identity. Then 

max T P(ao,r) = piS). It is easy to see that applying a matrix of $ to a vector only changes 
the value of coordinates labelled by a state or by a counter, and only modifies it by adding 
or removing the value of coordinate One (its value is left unchanged). Thus, for any vector 
v and any sequence of matrices of S: E\, ..., E n , all coordinates of vector vE\ ■ ■ ■ E n are 
bounded in absolute value by n-vone, which means that \\E\ ■ ■ ■ E n \\ ^ k , and thus /5(#) ^ 1. 

2. Assume tq is as follows: Eve always stores in a variable t the last time Adam played Init 
(initially t = 0). Then at turn i, she plays the matrix that corresponds to the {i — i)-th 
transition of the execution of the 2CMM. 

We fix a non-negative vector 

vo = (go = L= 0, x = 0, y = 0, E = 1, One = 1, Neg = 0) 
and prove by induction the following invariant on the vector v n = V 0 A 1 E 1 A 2 E 2 ■ ■ ■ A n E n : 

• all coordinates of v n are non-negative; 

• v n E > 1. 

Indeed applying a matrix of srf while respecting the rules of the 2CMM ensures that state 
and counter coordinates remain non-negative, while One,E and Neg remain unchanged. 

On the other hand Adam’s matrices are all non-negative, implying the first bullet of the 
invariant, and can only modify E by adding the value of Neg , which, by the invariant, was 
non-negative at the previous step. 

The above proves the invariant which implies the second item of the lemma. □ 

Proof sketch of Lem.m.a \A.2\ . 1. For ao, we consider the following strategy: 

• first play Init: Adam initializes a simulation of the 2CMM in his private memory in 
the form of a vector vq = (go = 1, Qi^o = 0,x = 0,y = 0,E=l, One = 1, Neg = 0), on 
which all subsequent matrices will be applied (yielding 14 , i> 2 , ■ • •); 

• then play Id as long as Eve plays valid transitions of the 2CMM; 

• play F c as soon as Eve plays an invalid move (if Eve lied on a counter value, then c is 
the name of this counter; if Eve lied on current state, then c is the name of this state; 
in both cases this corresponds to a negative coordinate); 

• play A until v n Ne9 = —1; 

• finally play P (nulling E) and a last time Init (nulling the whole vector). 

Explanation: the invariant from Lemma IA.ll holds as long as Eve simulates the 2CMM. 
When she stops simulating, the value of E is still 1 but some coordinate c is negative. The 
ending sequence F c A ■ ■ ■ AP Init forces the final vector to be 0, no move of Eve can then 
prevent this from happening, as she cannot modify One,E or Neg. 

Now remark that, for any vector v such that v E = 1, it holds that v ■ Init = uo and thus 
v ■ = 0 where f l is the product matrix for the whole play until Adam plays a last time Init. 

This proves that as much yields for any initial vector, thus = 0 and therefore P(uo,t) = 0. 
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2. The payoff function of an MMG is always non-negative. 
Proof of Lemma [H It directly follows from Lemmas IA.1I and IA.2I 


□ 


□ 

Proof of Lemma [^] General idea: Here, in order to use only non-negative matrices, we introduce a 
slightly different construction. Indeed, previous encoding relied on the fact that when Eve cheats, a 
negative coordinate appears that Adam can use to punish her (by nulling the product matrix with 
clever additions using the negative integers). Now there is no hope to create a negative element in 
the product matrix (since the matrices of the MMG are non-negative), so the punishment will be 
less drastic: Adam will just try to obtain a product that grows more slowly than the product of 
matrices corresponding to a faithful simulation. For this Adam needs to reset the game infinitely 
often, as to force Eve to cheat as many times, within a bounded horizon. 

The encoding uses the following idea: a counter of value k is encoded at time n by a coordinate 
of value 2 n+k . This way, a counter decrement consists in keeping its coordinate value unchanged, 
while a counter increment consists in multiplying its coordinate by 4. A counter stalling at a given 
step still sees its coordinate multiplied by 2. 

The vector space: Matrices are square of dimension |Q| +4. They act on vectors such that 
their \Q\ first coordinates represent the states of the 2CMM (positive value only in the coordinate 
corresponding to the active state of the simulation) and the four other coordinates x+, x_, y+, j/_, 
are the two counters and their opposite (i.e. their value will be 2 n ~ c instead of 2 n+c ). 

Eve’s matrices: In this game too, Eve tries to faithfully simulate the 2CMM. Her matrices are 
the following: 

• matrices I qq > x : q' := 2 q; q := 0; x + := Ax+; y + := 2 y + ; y_ := 2 y_\ 

• matrices K qq>x : q' := 2 q; q := 0; (x+,X-) := 2 (x-,x + ); (y+,y-) := 2(y_,y + ); 

• matrices D qq > x : q' := 2 q; q := 0; X- := 4x_; y + := 2y + ; y_ := 2y_; 

• matrices I qq ' y , K qq i y and D qq > y are defined likewise. 

Notice that the coordinate inversion of the previous construction, for the case of a successful 
x = 0 test, is now translated as a coordinate swap between x+ and X-. Thus when Eve cheats on 
a counter value, be it one way or the other, x+ has a value smaller than 2 n . 

Adam’s matrices: 

• matrix Id; 

• matrices P x (and P y ): q 0 := x+; q# 0 := 0; X- = x + ; y + = x+; = x + ; 

• matrices P q : let s = q then qo := s; q,^o := 0; x+ = s;x- = s; y+ = s; y- = s. 

Both Pi x>y \ and P q reset the simulation, forcing the copied value as the new norm for the product 
matrix. 

Adam’s strategy consists in playing Id most of the time; playing P q whenever Eve cheats on 
the state, implying a null product; and playing P c whenever Eve cheats on a counter c € {x, y} 
value, implying a factor of norm 2-f -1 since the last time when Adam played a P matrix (factor 
of length /). 

Since Eve needs to cheat with a positive frequency if the run of the 2CMM is finite, then the 
final payoff will be < 2 (from the product of such factors, which are of bounded length). 

If the run is infinite, whether Adam plays Id or a P, there will be some coordinate that remains 
of magnitude A 2". 

Details of the proof are similar to those of Lemma [5] We prove determinacy, with a value of 
2, in the case when the 2CMM has an infinite run. For the other case we prove that Adam can 
ensure a payoff < 2, but determinacy remains an open problem. □ 
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Proof of Lemmas [71-1171 

Proof of Lemma [7[ Assume D = {d \,..., g^m} and T = {ti,..., t n}- Given arbitrary strategies 
(ct, t) for the two players in the EG, let us represent the set of all compatible plays as a forest. 
Its nodes are labeled by elements of D on even levels and elements of T on odd levels, and its 
edges are labeled by symbols in E. The label of a node q is denoted £{q)\ the sequence of labels 
on the path reaching q from the appropriate root in the forest is referred to as its address a(q). 
The forest F is defined inductively as follows: 

• F has M root nodes labeled by di ,..., d/vr; 

• all the outgoing edges of a node q labeled d g D carry the symbol a = a(a(q)) and the sons 
of the node q correspond to (and are labeled by) the elements of {t | ( d , a, () £ A}; 

• all the outgoing edges of a node q labeled t g T carry the symbol b = r(a(q)) and the sons 
of the node q correspond to (and are labeled by) the elements of {d | ( t,a,d ) € A}. 

The payoff of the EG can be characterized in terms of the growth rate of this forest: 

P(cr, r) = limsup |F 2n | 1/n , 

n—> oo 

where F^ denotes the set of nodes of F at the level k. Indeed L(cr, r) is the set of labels of infinite 
paths of F, hence pref (L(cr, r)) is the set of addresses of nodes in F (we use the fact that our 
strategies are required to be non-blocking). To words of length 4n in pref (L(cr, r)) correspond 
addresses of nodes of level 2n, and thus 

limsup |pref 4n (L(cT,T))| 1/n = limsup |F 2n | 1/n 

n—too n—y oo 

as required. 

Let us characterize the number of nodes |F 2n | in terms of matrices. Let the vector £■(”) = 
(. x ..., x^) be such that x^ is the number of nodes labeled by di on 2n-th level of F; similarly 
let y(n't = (y[ n \ ■ ■ ■ ,yffi) be such that y^ is the number of nodes labeled by tj on (2 n + l)-th 
level of F. To relate to x( n ' ) we observe that 

M 

2/j") = ^ \ i q e Fan I = d i A a ( a ( t l)) = a}\c iaJ . 

i= 1 a£E 

Indeed, every node on level 2n with label di and action a generates on the next level a node with 
label tj whenever Ci a ,j = 1. Summing up on all i, a and q we obtain the quantity The 

expression for y can be rewritten as 

M 

Vj n) = X i n) dia c ia,j (17) 

2—1 aGS 

with fi^ = |{q g F 2n | £(q) = di A a(a(q)) = a}| fxf^ (whenever = 0, coefficients can 
be chosen arbitrarily, only respecting conditions (fT51) below). Intuitively, /i.-”' 1 is the proportion 
among the states di on level 2 n, of those for which Despot takes the action a. In matrix form C3 
can be rewritten as y^ = x^A n with A n> ij = ^ia c ia,j- We notice that 

dia ] ^ 0 and ^ /x|" } = 1, (18) 

aes 

thus i-th row of A n belongs to conv(j^), hence A n g conv(^/). Similarly, x.( n+1 ' ) = y^E n for 
some E n g conv(<f). Initially x ^ = (1,..., 1), and clearly |F„| = x^ • (1,..., 1) T , hence 

|F 2ra | = (!,..., 1)AqEqAiE\ ■ ■ ■ A„_i£’ n _i(l,..., 1) T = ||Ao-EoAi.Ei • • • A n _\E n _x\\. 
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Taking in the MMG over (conv(j^)), conv(d’)) the strategies and 9, which choose matrices 
Aq 1 Eq, Ai, Ei,... we obtain the required: 

-Peg(< 7 ,r) = limsup \F 2n \ 1,n = limsup \\A 0 E 0 AiEi ■ ■ ■ A n -iE n -i\\ 1/n = Pmmg{^,9). 

n—t oo n—>• oo 


It is easy to see that for positional a our construction gives A n = A a for all n. 


□ 


Proof of LemmaWTl As stated in [13j, Corollary 8.1.29], for any nonnegative matrix A and u > 0 

au ^ Au ^ f3u => a ^ p(A) ^ /3, (19) 


our statement (i) is now immediate. Let us prove the three remaining assertions. 

(ii) Let Au ^ pu for u > 0 with A > 0 and Au A pu. Then at least one coordinate of the vector 
Au — pu ^ 0 is strictly negative. Therefore the condition A > 0 implies strict negativity of all 
coordinates of the vector A(Au — pu). Then there exists e > 0 such that A(Au — pu) ^ —eu 
and therefore A 2 u = A(Au — pu) + pAu ^ (p 2 — e)u. Then, by m, we get p(A 2 ) < p 2 — e, 
and thus p{A) < \Jp 2 — e < p, q.e.d. 

(iii) The condition Au ^ pu with non-zero u ^ 0 implies A n u ^ p n u for any n ^ 1. Then 
||A ra || • ||u|| ^ ||A"u|| ^ p ra ||u||. Therefore ||A n || > p n , and by Gelfand’s formula p{A) ^ p, 
q.e.d. 

(iv) Now let A > 0 and Au 7 ^ pu. Then at least one coordinate of the vector Au — pu ^ 0 is 

strictly positive. Therefore the condition A > 0 implies strict positivity of all the coordinates 
of the vector A(Au — pu). Then there exists e > 0 such that A(Au — pu) ^ eu and therefore 
A 2 u = A(Au — pu) + pAu ^ (p 2 + e)u. This, by (iii) applied to the matrix A 2 , implies 
p(A 2 ) > p 2 + e, and thus p(A) > p 2 + e > p, q.e.d. □ 

Proof of Lemma 1111 To prove (i), we represent the vectors u and v in coordinate form: 


u = (m,U2, ■ ■ ■ ,um) t , v = (vi,v 2 ,. ■. ,vn) t - 


Suppose that for some matrix A = (a-ij) £ stf the inequality Au ^ v fails. Then 


auui + a i 2 u 2 H-b a iM UM < 

for some i £ {1,2,..., IV}; we may assume % = 1 without loss of generality. In this case, the matrix 



an 

a 12 

oim 

021 

d 22 

a2M 

djvi 

0.N2 ■ ■ 

CLNM 


obtained from the matrix A = (dij) replacing the first row by a 1 = [an, ai 2 , • ■ •, cJim], yields 
the inequalities a^Ui + a,i 2 u 2 + • • • + ai mum < Vl! and dnUi + di 2 u 2 + • • • + cl^mUm = for 
i = 2 , 3,..., N. Consequently, Au ^ v and Au 7 ^ v, which completes the proof of the first statement 
of the lemma. The proof of statement (ii) is similar. □ 

Proof of Corollary [lH We denote V = mm fsA ,88) and V' = mm(conv(,e/), conv(^)). Then 

1 2 3 . 

V' = min max p(BA) ^ min max p(BA) = min maxp(BA) = V, 

AGconv(^) SGconv(^) conv(^) 

where 1 follows from the equality p(AB) = p(BA), 2 from the inclusion s8 C conv(g/), 3 from 
Lemma Q] and equalities ([3} . Symmetrically, 

V' = max min p(AB) ^ max min p(AB) = max min p(AB) = V. □ 

SGconv(^) AGconv(^) A(fz conv(,c/) A$Ls4 
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Proof of Lemma \15\ Let o' and r' be arbitrary strategies in the EG, then by Lemma [7] for the 
strategy pair (o',t) there is a corresponding pair (g',E T ) with some strategy having the same 
value in the MMG. Symmetrically for the pair (a, t') there is a corresponding pair (A a ,6'). We 
have: 


PW,T) = PW,E t ) < P(A a ,E T ) = P(o,t) 


P(A r7 ,E T )^P(A a ,9 , )=P(o,T'), 


where the equalities come from Lemma 0 and the inequalities from the optimality of E T and A a , 
respectively. Thus a and r are optimal. □ 

We will need the following result in order to prove Lemma [171 

Lemma A.3. Let si be a compact IRU-set of positive (N x N)-matrices. 

(i) If A £ si is a matrix satisfying p(A) = p{si) and v is its positive eigenvector corresponding 
to the eigenvalue p{A), then Av ^ p(si)v for all A £ si. 

(ii) If A £ si is a matrix satisfying p(A) = p(si) and v is its positive eigenvector corresponding 
to the eigenvalue p(A), then Av ^ p(si)v for all A £ si. 

Proof of Lemma \A.A To prove (i) let us note that Av = p(si)v. Then by Lemma HIT i) either 
Av ^ p(si)v for all A £ si or there exists a matrix A £ si such that Av i p{si)v and Av p(si)v. 
In the latter case, by Lemma ITOl the inequality p{A) < p{si) would hold, which contradicts to the 
definition of p(si). Hence, the inequality Av ^ p{si)v holds for all A £ si, q.e.d. Assertion (ii) is 
proved similarly. □ 

Proof of Lemma m For positive matrices, implications <= follow from Lemma 1101 As for =£-, it 
suffices to take v the eigenvector (corresponding to the spectral radius) of the matrix A £ si with 
the largest (smallest) spectral radius, and to apply Lemma [A~TH 
As for non-negative matrices, we have four implications to prove: 

m, => Denote, for any e > 0, si e = {A + el | A £ si}. If p(si) < a then due to compactness 
of the set si there exists e > 0 such that p{si e ) = p{&i + el) < a. Then by (fT2l) (already 
proved for positive matrices), there exists v > 0 such that (A + sl)v < av for all A £ si. 
Since Av < (A + el)u, then Av < av for all A £ si, q.e.d. 

4= Suppose there exists v > 0 such that Av < av for all A £ si. Then due to compactness 
of the set si there exists e > 0 such that (A + el)v < av for all A £ si. Therefore by (1121) 
(for positive matrices) p(si + el) < a, and hence by the monotonicity of the spectral radius 
we obtain p[si) < a, q.e.d. 

=> Let p(si) ^ a, then by the monotonicity of the spectral radius it holds that p(sf +eT) ^ a 
for any e > 0. Then by (TT5l) (for positive matrices) for any e > 0 exists a vector v £ > 0 such 
that ||u E || = 1 and 

(A + sl)v e ^ av e (20) 

for all A £ si. Choose a sequence e n —> 0 for which the corresponding vectors v En converge 
to some vector v ^ 0 (let us point out that ||u|| = 1 and so it is non-zero). Then passing to 
the limit in (|2U)| we obtain Av ^ av for all A £ si, q.e.d. 

m, <= Suppose there exists a non-zero vector v ^ 0 such that Av ^ av for all A £ si. Then by 
Lemma fTTTl p(A) ^ a for all A £ si and hence p(si) > a, q.e.d. □ 


mi, 


(HU, 
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